Paul Holmes-Higgin and Khurshid Ahmad, University of Surrey Assembling and viewing a corpus of texts: self-organisation, logical deduction and spreading activation as metaphors

نویسندگان

  • Paul Holmes-Higgin
  • Khurshid Ahmad
چکیده

Lexicographical and terminological work is increasingly dependent on the analysis of texts, particularly texts organised in a corpus and being made available through computer systems. We argue that the developments in corpus linguistics, artificial intelligence, connection sciences, and lexicography and terminology, can be conjoined together to analyse the various facets of a text. In particular, the users of corpora will be allowed to explore the ‘family resemblance’ of such texts with other texts. This will help in the creation of a user-defined corpus of texts that belong to a family, all having their own idiosyncrasies but all sharing something through a common ‘genetic’ pool. Our approach, a strictly computational account of corpus organisation and usage, will help corpus builders and end-users to incorporate as much as is known about the texts in general and whatever is known about the contents in the description of texts for storage and for retrieval. We demonstrate our interdisciplinary approach by describing how texts in a computer-based corpus can be (a) represented by using knowledge representation formalisms, such as frames, (b) automatically classified by using self-organising artificial neural networks, and (c) managed by using a hybrid representation scheme wherein interactive activation and competition networks are used in conjunction with frames and deductive data bases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P 99: Self-Assembling Peptide Scaffolds as New Therapeutic Method in TBI: Focused on Bioactive Motifs

Traumatic brain injury (TBI) is a common reason of brain tissue loss as a result of tumors, accidents, and surgeries. Renewal of the brain parenchyma is restricted by many reasons such as inimical substances produced as the result of trauma and also inflammatory responses. A strong cascade of inflammatory responses begins as a result of TBI which include recalling peripheral leukocytes into the...

متن کامل

Conceptual Metaphoric Language Use in Structuring Political Discourse in Iran-West Relations: A CDA Perspective

The present study was carried out with the purpose of examining the role of metaphorical language in the critical discourse analysis (CDA) of political texts based on a modern framework postulated by Kövecses (2015). The corpus of the study consisted of thirty-thousand words chosen as a textual sample to see which source conceptual domains are used and what generic/discursive attributes emerge ...

متن کامل

A Cognitive Study of Conceptual Metaphors in English and Persian: Universal or Culture-Specific?

In the last 2 decades, studies on conceptual metaphors have profoundly increased. The development in this field was followed by Lakoff and Johnson's (1980b) work on describing the conceptual role played by metaphors and their correspondence with language and thought. This study aimed to compare conceptual metaphors in Persian and English through a corpus-based approach as well as examining both...

متن کامل

A Description of Texts in a Corpus: 'Virtual' and 'Real' Corpora

The extensive use of computer-based corpora for a range of language studies has led to the proliferation of the ways in which texts within an individual corpus are organised Basically, the organisation reflects the immediate needs of a group of well motivated users, like lexicographers or terminologists. This means that the subsequent generation of corpus users are forced to use a classificatio...

متن کامل

University of Surrey Participation in TREC8: Weirdness Indexing for Logical Document Extrapolation and Retrieval (WILDER)

This paper describes the development of a prototype document retrieval system based on frequency calculations and corpora comparison techniques. The prototype, WILDER, generated simple frequency information based on which calculations of document relevance could be made. The prototype was built to allow the University of Surrey to debut in the U.S. Text Retrieval Competition (TREC). User querie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007